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ABSTRACT 


In  the  analysis  of  radar  signal  processing,  many  approximations  are  made  in 
order  to  simplify  systems  analysis.  In  the  context  of  radar  detection,  environ¬ 
mental  assumptions,  such  as  in  the  modelling  of  clutter  and  noise,  are  often 
made.  Similar  assumptions  are  employed  in  the  analysis  of  speckle  in  radar 
imaging  systems.  In  addition  to  this,  further  statistical  approximations  of  ran¬ 
dom  processes  of  interest  are  made.  In  radar  detection  theory,  an  assumption 
on  the  distribution  of  a  clutter  statistic  is  often  made.  An  important  part 
of  the  model  validation  process  is  to  assess  the  validity  of  employed  assump¬ 
tions.  The  validity  of  environmental  assumptions  is  often  assessed  on  the  basis 
of  field  trials.  In  the  case  of  approximations  of  random  processes  of  interest, 
there  are  a  number  of  approaches.  This  study  introduces  a  general  scheme, 
known  as  Stein’s  Method,  for  assesssing  distributional  approximations  of  ran¬ 
dom  processes.  It  is  described  in  general  terms,  and  its  application  to  the 
Poisson  and  Gaussian  approximation  of  a  given  random  variable  is  outlined.  A 
new  development  of  Stein’s  Method  for  Exponential  approximation  is  included. 
The  importance  of  the  latter  is  that  it  can  be  used  as  an  approximating  distri¬ 
bution  for  a  number  of  random  variables  of  interest  in  radar. 
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Stein’s  Method  and  its  Application  in  Radar  Signal 

Processing 


EXECUTIVE  SUMMARY 


An  important  component  of  the  ongoing  DSTO  efforts  in  Radar  Modelling  and  Analysis 
Group  is  model  validation  and  verification.  As  a  part  of  the  efforts  in  a  number  of  DSTO 
Tasks  (AIR  01/217,  AIR  02/232,  NAV  01/299),  radar  clutter  modelling  and  analysis  work 
has  been  undertaken.  Probability  distributions,  such  as  Rayleigh,  Weibull  and  the  K, 
have  been  found  to  be  appropriate  for  the  modelling  of  radar  clutter,  in  various  scenarios. 
Such  modelling  approximations  for  clutter  can  be  validated  via  empirical  studies  through 
trials  data.  In  the  area  of  radar  detection,  further  assumptions  are  made  in  the  design 
of  a  radar  detection  scheme.  In  constant  false  alarm  rate  (GEAR)  detectors,  a  function 
computing  a  measure  of  the  clutter  level  is  used.  In  order  to  assess  the  performance  of  a 
particular  scheme,  statistical  assumptions  are  either  imposed  on  this  measure,  or  Monte 
Garlo  estimation  is  necessary.  There  may  in  fact  be  sufficient  grounds  to  assume  that  a 
clutter  measure,  in  a  given  environment,  has  a  particular  statistical  distribution.  If  such  an 
approximation  was  valid,  it  could  improve  the  estimation  of  radar  performance  measures 
such  as  the  false  alarm  and  detection  probabilities. 

Hence,  it  is  desirable  to  be  able  to  assess  the  validity  of  such  approximations.  The  Stein- 
Ghen  Method,  commonly  referred  to  as  Stein’s  Method,  is  a  general  scheme  whereby 
such  distributional  approximations  can  be  assesssed  mathematically.  The  scheme  allows 
a  bound  to  be  constructed,  on  the  distributional  approximation  of  one  random  variable 
by  another.  Stein’s  Method  is  illustrated  in  the  case  of  the  Poisson  and  Gaussian  approx¬ 
imation  of  random  variables.  The  latter  distributions  are  frequently  used  in  radar,  and  so 
these  results  may  be  applicable  to  radar  performance  analysis. 

A  new  development  of  Stein’s  Method,  for  measuring  the  distributional  approximation  of 
a  random  variable  by  an  Exponential  distribution,  is  included.  This  work  has  direct  appli¬ 
cation  to  a  number  of  distributions  of  interest  in  radar.  In  particular,  the  time  to  detection 
in  a  radar  detection  scheme  can  be  approximated  by  such  a  distribution.  Additionally, 
some  processes  in  radar  image  speckle  modelling  are  asymptotically  Exponential. 

The  purpose  of  this  report  is  to  introduce  the  mathematical  framework  for  further  work 
on  the  application  of  Stein’s  Method  to  radar  systems  analysis. 
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1  Introduction 


A  core  responsibility  of  Electronic  Warfare  and  Radar  Division’s  Radar  Modelling  and 
Analysis  Group  is  to  provide  technical  advice  to  the  Australian  Defence  Force  on  the 
performance  of  radar  systems.  This  work  has  required  the  analysis  and  evaluation  of 
radar  systems  in  different  environmental  scenarios.  As  a  result  of  this,  many  studies  have 
been  undertaken  to  classify  the  types  of  clutter  where  Australian  assets  are  likely  to  be 
deployed.  This  has  resulted  in  the  studies  of  [Antipov  1998,  2001],  which  were  undertaken 
as  part  of  the  Task  AIR  98/084,  which  has  been  succeeded  by  AIR  01/217.  This  Task  was 
designed  to  provide  the  RAAF  with  advice  on  the  performance  of  the  Israeli  designed  Elta 
EL/M-2022  maritime  radar,  used  in  the  AP-3C  Orion  fleet.  Also  within  the  Microwave 
Radar  Branch,  [Dong  2003,  2004]  has  undertaken  studies  of  clutter  modelling,  in  support 
of  the  Task  AIR  02/232.  The  latter  is  to  provide  technical  support  for  the  Airborne  Early 
Warning  and  Control  (AEW&C)  acquisition  under  Project  Wedgetail.  A  key  component 
in  all  these  studies  of  clutter  is  the  search  for  a  good  mathematical  model.  In  some  cases, 
Rayleigh,  Weibull  and  the  K-distributions  have  been  found  to  fit  clutter  well  [Antipov 
1998,  2001  and  Dong  2004].  These  clutter  models  are  validated  through  trials.  Models 
of  the  performance  of  radars  are  also  validated  by  comparing  a  model’s  behaviour,  in  a 
prescribed  clutter  scenario,  to  the  radar’s  performance  in  a  similar  real  environment. 

In  the  development  of  mathematical  models  of  radar  performance,  many  assumptions  are 
made.  As  an  example,  in  the  case  of  radar  detectors,  a  constant  false  alarm  rate  (CFAR) 
detector  [Levanon  1988  and  Minkler  and  Minkler  1990]  estimates  the  level  of  clutter,  in 
order  to  set  an  appropriate  detection  threshold  for  a  desired  probability  of  false  alarm. 
This  measure  of  clutter  may  be  a  simple  arithmetic  mean,  or  a  complex  function  of  other 
statistics  of  the  clutter.  A  key  issue  with  the  design  of  an  effective  CFAR  scheme  is  the  best 
choice  for  the  clutter  measure.  The  latter  will  change  depending  on  the  type  of  clutter  the 
radar  is  processing.  For  a  given  clutter  scenario,  the  performance  of  a  CFAR  thus  needs 
to  be  evaluated.  Typical  performance  measures  used  are  the  probability  of  detection  and 
false  alarm.  To  evaluate  these,  it  is  often  necessary  to  make  statistical  approximations. 
Monte  Carlo  Methods  are  often  used,  due  to  analytic  intractibility  of  these  probabilities. 
However,  in  some  cases,  it  may  be  possible  to  apply  a  distributional  approximation  to  the 
clutter  measure  before  applying  Monte  Carlo  techniques.  The  latter  has  the  potential  to 
speed  up  convergence  of  estimators.  Thus  it  would  be  useful  to  have  a  framework  that 
allows  the  errors  in  such  approximations  to  be  estimated. 

The  purpose  of  this  report  is  to  introduce  a  scheme  that  allows  the  mathematical  analysis 
of  such  approximations  to  be  assessed.  This  scheme,  known  as  Stein’s  Method,  will  be 
discussed  in  considerable  detail.  Its  application  to  a  number  of  radar-related  problems  will 
also  be  outlined  in  general  terms. 


1.1  Stein’s  Method 


A  significant  component  of  the  theory  of  probability  is  concerned  with  the  convergence 
of  random  variables  and  probability  distributions  [see  Billingsley  1986,  Durrett  1996  and 
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Ross  1996].  From  a  practical  point  of  view,  a  modelling  application  of  stochastic  processes 
will  often  necessitate  the  estimation  of  probabilities  of  interest.  An  example  of  this  is  the 
analysis  in  [Weinberg  1999]  on  the  modelling  of  stochastic  queueing  systems.  Estimating 
probabilities  is  a  part  of  the  more  general  problem  of  approximate  computation  of  expecta¬ 
tions^.  The  work  presented  here  is  an  examination  of  the  methods  introduced  by  Charles 
Stein,  and  his  former  student  Louis  Chen.  Although  the  focus  of  the  work  presented 
here  is  the  mathematics  behind  Stein’s  Method,  its  potential  application  to  radar  related 
problems  will  also  be  outlined. 

In  [Stein  1972]  a  general  scheme  was  introduced,  which  provided  a  new  proof  of  the  classical 
Central  Limit  Theorem  of  statistics.  The  techniques  provided  Berry-Esseen  type  bounds 
[Esseen  1945,  Eeller  1968  and  Chen  and  Shao  2003]  on  the  Gaussian  distributional  approx¬ 
imation  of  a  sum  of  random  variables.  In  previous  work,  tools  such  as  Eourier  analysis 
were  used  to  derive  such  estimates  [Esseen  1945].  Stein’s  techniques  differed  because  they 
used  a  characterisation  of  the  Gaussian  distribution  to  construct  error  rates. 

Stein  found  a  first  order  linear  differential  equation  could  be  used  to  characterise  the 
Gaussian  distribution.  This  converted  the  problem  of  measuring  distributional  differences 
to  one  of  bounding  an  expectation.  This  provided  a  new  way  of  assessing  distributional 
approximations.  One  of  the  interesting  features  of  this  method  is  that  it  is  rather  general. 
[Chen  1975]  was  the  first  to  apply  the  technique  to  other  distributions,  and  considered 
Poisson  random  variable  approximations. 

It  was  the  discovery  by  [Barbour  1988]  that  explained  the  generality  of  Stein’s  Method. 
By  a  derivative  transformation  of  the  original  Stein  equation  for  Gaussian  approximation, 
it  was  found  that  part  of  the  Stein  equation  is  the  generator  of  an  Ornstein-Uhlenbeck 
diffusion  process  [Aldous  1989].  This  process  has  the  Gaussian  distribution  as  its  station¬ 
ary  distribution.  In  the  case  of  the  Poisson  Stein  equation,  the  generator  is  that  of  an 
immigration-death  process  on  the  nonnegative  integers  [Ross  1983].  Again,  this  process 
has  as  its  stationary  distribution  the  Poisson  distribution.  The  explanation  for  the  pres¬ 
ence  of  a  generator  in  the  Stein  equation,  and  the  reference  distribution  being  the  processes 
stationary  distribution,  can  be  found  in  [Ethier  and  Kurtz  1986].  Markov  processes  are 
the  key  to  this  generality.  It  can  be  shown  that  a  Markov  process  is  uniquely  charac¬ 
terised  by  its  generator,  and  that  a  distribution  is  the  stationary  distribution  of  a  Markov 
process  if  and  only  if  the  expectation  of  its  generator,  under  the  stationary  distribution, 
is  identically  zero.  The  implication  of  this  is  that  if  one  is  interested  in  developing  a  Stein 
equation  for  a  reference  distribution  //,  then  one  needs  to  find  a  Markov  process  with  p,  as 
a  stationary  distribution.  When  this  is  done,  the  problem  of  measuring  distributional  dif¬ 
ferences  translates  to  one  of  bounding  an  expectation  of  a  generator  of  a  Markov  process. 
This  generalisation  of  Stein’s  Method,  and  the  manipulation  of  the  underlying  generator, 
is  known  as  the  probabilistic  interpretation  or  method. 

The  probabilistic  method  does  not  always  work  well  in  practice.  Compound  Poisson 
approximation  [Barbour  and  Chryssaphinouis  2001]  is  an  example  where  the  probabilistic 
approach  is  problematic.  However,  it  can  always  provide  a  means  of  deriving  an  initial 
Stein  equation  for  applications  of  interest. 

'^This  is  the  title  of  Stein’s  Lecture  Notes  [Stein  1986]. 
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Stein’s  Method  has  been  applied  extensively,  in  a  wide  range  of  contexts.  In  the  Poisson 
setting,  [Barbour,  Holst  and  Janson  1992]  is  an  encyclopaedia  of  applications,  includ¬ 
ing  random  permutations,  random  graphs,  occupancy  and  urn  models,  spacings  and  ex¬ 
ceedances  and  extremes.  Some  other  examples  that  can  be  found  in  the  stochastic  processes 
literature  include  the  work  of  [Barbour  1988]  on  multivariate  Poisson  approximations, 
[Barbour  1990]  on  diffusion  approximations,  [Barbour  and  Brown  1992]  on  Poisson  point 
process  approximation,  [Barbour  and  Brown  1996]  on  the  application  of  Stein’s  Method 
to  networks  of  queues,  [Barbour,  Chen  and  Loh  1992  and  Barbour  and  Chryssaphinouis 
2001]  on  compound  Poisson  approximations,  [Phillips  1996]  on  negative  binomial  approxi¬ 
mations,  [Greig  1995]  on  Erlang  fixed  point  approximations  for  telecommunications  appli¬ 
cations,  [Loh  1992]  on  multinomial  approximations,  [Luk  1994]  on  gamma  approximations 
and  [Pekdz  1996  and  Phillips  and  Weinberg  2000]  on  geometric  approximations. 

The  first  application  of  Stein’s  Method  to  radar  can  be  found  in  [Weinberg  2005].  In  the 
latter,  it  is  shown  that  the  single  pulse  probability  of  detection  of  a  target  in  Gaussian 
clutter  and  noise  is  equivalent  to  a  comparison  of  two  independent  Poisson  random  vari¬ 
ables.  In  order  to  undertake  an  asymptotic  analysis  of  this  detection  probability,  Stein’s 
Method  for  Poisson  approximation  yielded  a  new  expression  for  a  comparison  of  Poisson 
variables.  This  enabled  easy  asymptotic  analysis  of  the  detection  probability.  However, 
the  approach  used  in  [Weinberg  2005]  does  not  easily  indicate  how  Stein’s  Method  is  used 
in  general.  Specifically,  it  does  not  indicate  how  a  distributional  approximation  made  for 
a  clutter  statistic  can  be  evaluated. 

This  report  will  introduce  the  development  of  Stein’s  Method  in  a  number  of  contexts.  The 
case  of  the  Poisson  approximation  of  a  sum  of  indicator  random  variables  will  be  considered 
firstly.  A  bound  on  the  Poisson  approximation  of  a  Binomial  distribution  will  follow  as  a 
consequence.  To  illustrate  the  way  Stein’s  Method  works  for  nonatomic  distributions,  the 
Gaussian  approximation  of  a  sum  of  random  variables  will  be  considered,  and  a  Berry- 
Essen  type  bound  will  be  derived.  A  new  development  of  Exponential  approximation  will 
be  outlined,  together  with  its  usage  and  application  to  potential  problems  in  radar. 

A  major  objective  of  the  report  is  to  disseminate  to  a  wide  audience  the  potential  usage 
and  application  of  Stein’s  Method  in  defence  scientific  research  applications.  Although 
the  applications  pointed  to  here  will  be  exclusively  to  radar,  there  is  scope  to  apply  the 
general  methods  to  communications  and  weapon  systems  modelling. 
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2  Stein’s  Method 


This  section  begins  with  the  development  of  Stein’s  Method  in  general  terms,  using  the 
probabilistic  approach  of  [Barbour  1988].  It  will  be  outlined  how  a  Stein  equation  can  be 
derived  for  a  particular  application.  The  specific  cases  of  Poisson  and  Gaussian  approxi¬ 
mations  will  be  used  as  examples  of  the  usage  of  the  scheme.  The  Gaussian  distribution 
is  of  considerable  importance  in  radar,  and  so  these  results  may  be  of  interest  to  radar 
analysts.  Recent  work  on  the  single  pulse  probability  of  detection  of  a  target  in  Gaussian 
clutter  has  shown  the  importance  of  the  Poisson  distribution  in  radar  [Weinberg  2005]. 
Hence  Poisson  approximation  is  also  of  interest.  Some  of  the  mathematical  notation  and 
definitions  can  be  found  in  Appendix  A. 


2.1  General  Principles 


In  this  Subsection,  Stein’s  Method  in  outlined  in  general  terms.  In  particular,  the  prob¬ 
abilistic  method,  which  associates  a  distribution  with  a  corresponding  Markov  process, 
and  bounds  the  expectation  of  its  generator,  is  introduced.  This  general  development  is 
important  because  it  indicates  how  the  scheme  can  be  extended  to  practical  applications 
of  interest.  However,  the  applications  of  Stein’s  Method  to  the  Poisson,  Gaussian  and 
Exponential  cases  considered  in  this  report,  do  not  directly  require  this  more  abstract 
formulation.  Hence,  the  less  mathematically  inclined  reader  can  start  with  the  specific 
examples  in  Subsections  2.2  and  2.3,  without  reference  to  this  Subsection. 

Suppose  we  have  a  probability  distribution  //,  associated  with  some  random  variable  Z. 
Hence  P(Z'  G  A)  =  n{A),  for  a  measurable  set  A.  We  refer  to  n  as  the  reference  distri¬ 
bution.  Suppose  we  are  interested  in  some  other  random  variable  X,  and  in  particular, 
suppose  we  know  empirically  that  the  distribution  of  X  can  be  approximated  by  /r.  This 
means  that 

JP{X  £  A)^]P{Z  eA)  =  n{A).  (1) 

We  refer  to  the  distribution  of  X  as  the  target  distribution.  From  an  analysis  point  of 
view,  it  is  important  to  determine  whether  the  distributional  approximation  implied  by 

(1)  is  valid.  Hence,  it  would  be  useful  to  be  able  to  get  an  estimate  of  the  difference  of 
distributions 

|P(Ag  A)-P(Zg  A)|.  (2) 

Stein’s  Method  enables  one  to  quantify  the  approximation  in  (1)  by  obtaining  upper  and 
lower  bounds  on  (2). 

The  basic  idea  behind  Stein’s  Method  is  to  find  an  equation  characterising  a  reference 
distribution,  that  enables  one  to  obtain  bounds  on  distributional  differences,  such  as  in 

(2) .  Stein’s  Method  converts  this  problem  to  one  of  bounding  an  expectation  of  a  generator 
of  an  associated  Markov  process. 
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and  Kurtz  1986]  are  useful  references  on  the  material  to  follow.  All  definitions  used  can 
be  found  in  these  references.  Let  {Xt,t  >  0}  be  a  time  homogeneous  Markov  process,  on 
a  state  space  £,  and  suppose  A  is  its  generator.  Define  an  operator  on  a  suitable  class  of 
functions  h  G  X  by  Tth  =  ]E[/i(At)|Ao  =  x)].  Then  the  generator  of  the  Markov  process 
At  is 


Ahix)  =  lim 


\Tth{x)\  -  h{x) 
t 


A  generator  uniquely  characterises  a  Markov  process.  The  key  to  Stein’s  Method  is  the 
following  principle.  For  a  stationary  distribution  /r,  A  =  /t  if  and  only  if  IE[Ah(A)]  =  0 
for  all  h  for  which  Ah  is  defined  [Ethier  and  Kurtz  1986,  Lemma  4.9.1  and  Proposition 
4.9.2].  Thus,  suppose  we  have  a  target  random  variable  A,  and  we  want  to  compare 
its  distribution  with  /r,  as  in  (1).  Then  the  above  implies  that  if  lE[A/i(A)]  0,  then 
the  distribution  of  A  is  approximately  equal  to  where  A  is  the  generator  of  a  Markov 
process  with  stationary  distribution  fj,. 


In  order  to  assess  such  distributional  approximations,  we  attempt  to  find  the  solution  h{x) 
to  the  Stein  equation 

f{x)  -  J  fdia  =  Ah{x),  (3) 

for  some  appropriate  function  /,  and  a  reference  distribution  fj,.  As  an  example,  we  could 
choose  f{x)  =  II[a;eA]j  some  measurable  set  A,  and  where 


^[xdA] 


1  if  X  G  A; 

0  otherwise. 


Then  it  follows,  by  applying  expectations  to  both  sides  of  (3),  that  if  A  is  a  random 
variable. 


IE 


J  ^[x&A]dti 


=  1P(A  G  A)  -  /r(A)  =  IE[Ah(A)]. 


(4) 


Hence,  in  order  to  assess  the  distributional  approximation  of  A  by  fi,  (4)  implies  we  may 
attempt  to  estimate  the  generator  component  ]E[Ah(A)].  Consequently,  since  this  will 
depend  on  the  function  h,  which  is  the  solution  to  the  Stein  equation  (3),  we  need  to 
investigate  whether  we  can  in  fact  solve  for  h  in  (3). 


It  turns  out  that  the  equation  (3)  can  be  solved  generally  [Barbour,  Holst  and  Janson  1992]. 
If  Xt  is  an  ergodic  Markov  process  [Ethier  and  Kurtz  1986]  with  stationary  distribution 
/r,  then  under  some  conditions  on  the  functions  /  and  h,  the  solution  to  (3)  is 


h{x) 


fiXt)  -  J  fdfa 


dt, 


(5) 


where  W,^[f{Xt)]  =  IE[/(At)|Ao  =  x].  The  required  conditions  on  the  functions  /  and 
h,  so  that  (5)  solves  (3),  can  be  established  using  the  general  theory  of  Teller  processes 
[Ethier  and  Kurtz  1986].  One  then  attempts  to  write  the  generator  component  of  the  Stein 
equation  in  terms  of  h,  and  bounds  on  h  are  then  used  to  obtain  bounds  on  the  generator. 
This  is  then  applied  to  (4),  providing  upper  bounds  on  the  distributional  approximation. 


We  provide  some  examples  of  Markov  processes  for  some  implementations  of  Stein’s 
Method.  When  the  reference  distribution  is  Poisson  with  mean  A,  the  appropriate  process 
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is  an  immigration-death  process,  with  immigration  rate  A  and  unit  per  capita  death  rate 
[Ross  1983].  The  generator  is 

Ah{x)  =  A  ^h{x  +  1)  —  h{x)^  —  x  ^h(x)  —  h{x  —  1)^  .  (6) 

In  the  case  of  Gaussian  approximations,  the  process  is  the  Ornstein-Uhlenbeck  diffusion 
process  [Aldous  1989],  with  generator 

Ah{x)  =  h''{x)  —  xh'{x).  (7) 

It  can  be  shown  in  both  cases  that  the  reference  distribution  is  the  appropriate  Markov 
process  stationary  distribution. 

In  many  cases,  the  probabilistic  method  outlined  above  works  very  well,  such  as  general 
random  variable  approximations  [Weinberg  1999,  2000]  and  Poisson  process  approxima¬ 
tions  [Barbour  and  Brown  1992].  However,  there  are  cases,  such  as  compound  Poisson 
approximation  [Barbour,  Chen  and  Loh  1992  and  Barbour  and  Chryssaphinouis  2001], 
where  it  does  not  work.  The  probabilistic  method  is,  however,  not  always  essential.  Stein’s 
original  work  was  not  based  on  the  generator  approach  outlined  here,  nor  was  the  original 
work  of  Chen.  It  does,  however,  provide  the  framework  for  the  construction  of  a  Stein 
equation  for  a  particular  application. 

In  the  following  two  subsections  we  introduce  Stein’s  Method  for  Poisson  and  Gaussian 
approximations . 


2.2  A  Discrete  Case:  Poisson  Approximation 


A  Poisson  random  variable  is  a  model  for  counting  rare  events  [Ross  1983].  It  is  as 
common  a  model  for  such  phenomena  as  the  Gaussian  is  for  common  events.  As  shown  in 
[Weinberg  2005],  it  arises  in  detection  probabilities  for  single  pulses  in  Gaussian  clutter. 
This  subsection  is  concerned  with  the  development  of  Stein’s  Method  for  the  Poisson 
approximation  of  a  sum  of  indicator  random  variables.  This  is  done  to  illustrate  the 
development  of  Stein’s  Method  for  discrete  random  variables.  The  general  application  of 
Stein’s  Method  for  arbitrary  random  variables  can  be  found  in  [Barbour,  Holst  and  Janson 
1992  and  Weinberg  1999,  2000]. 


The  key  to  the  method  in  this  context  is  the  following  characterisation  of  a  random  variable 
VP:  IT  is  Poisson  with  mean  A  if  and  only  if  for  all  bounded  functions  g  with  g{0)  =  0, 

IE[A5(IT  +  1)]  =  IE[IT5(IT)]. 

The  proof  of  necessity  is  quite  straightforward.  To  prove  sufficiency,  one  attempts  to  find 
a  unique  solution  to  the  equation 


+  1)  -  jg{j)  =  II[jeA]  -  Po(A){A}, 
for  each  j  G  IN,  where  A  C  IN  and 


I[ieA] 


1  if  j  G  A; 

0  otherwise. 


(8) 
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Equation  (8)  is  called  the  Stein  equation  for  Poisson  approximation.  By  introducing  a 
functional  transformation  of  g{j)  =  h{j)  —  h{j  —  1),  the  left  hand  side  of  (8)  transforms 
to  the  generator  (6).  If  a  unique  solution  to  (8)  can  be  found,  then  for  a  random  variable 
W  with  mean  A,  it  follows  that 


TE[\g{W  +  1)  -  Wg{W)]  =  W{W  e  A)  -  Po(A){^}.  (9) 


The  idea  is  that  the  Poisson  approximation  of  W  can  be  determined  by  bounding  the 
expectation  in  (9).  As  an  example,  suppose  the  random  variable  bP  is  a  sum  of  indicators, 
and  so  takes  the  form 

n 

W  =  Y,X,,  (10) 

where  each  Xj  is  independent  and  distributed  according  to 


1  with  probability  pj ; 

0  with  probability  1  —  Pj- 


Random  variable  bP  is  a  generalisation  of  a  Binomial,  and  is  called  a  Multinomial  distri¬ 
bution.  We  are  interested  in  the  Poisson  approximation  of  bP.  Let  A  =  lE[bP]  =  YTj^iPj, 
and  define  a  random  variable  Wj  =  W  —  Xj.  Then  we  have  the  following  Theorem: 


Theorem  2.1  For  the  random  variable  W  defined  above,  and  for  any  set  A  C  IN, 

\nw  G  A)  -  Po(A){A}|  < 

Thus,  the  bound  of  Theorem  2.1  gives  the  rate  of  convergence  of  the  distribution  of  bP 
to  a  Poisson  limit.  We  now  outline  the  proof  of  Theorem  2.1.  For  technical  reasons,  we 
assume  the  solution  to  the  Stein  equation  (8)  satisfies  ^(O)  =  0.  It  can  be  shown  [Barbour, 
Holst  and  Janson  1992]  that  a  unique  solution  exists  to  (8),  and  is  given  by 

gU  +  1)  =  A-^- V!e^(Po(A){A  n  U,}  -  Po(A){A}Po(A){[/,}),  (11) 

where  Uj  :=  {0, 1, . . . ,  j}.  Furthermore,  the  differences  of  g  are  uniformly  bounded: 

sup|ff(j-bl)-ff(j)|  <  — .  (12) 

jeiN  ^ 

A  proof  of  (12)  can  also  be  found  in  [Barbour,  Holst  and  Janson  1992],  or  as  a  consequence 
of  the  results  in  [Weinberg  2000].  We  now  manipulate  the  expectation  of  (9).  Observe 
that 

n 

lE[Aff(lP  +  1)]  =  +  Xj).  (13) 

i=i 

Also 

n  n 

nwg{w)]  =  j2nxjg{Wj+Xj)]  =  j2pj^9iWj  + 1).  (u) 

i=i  i=i 
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Hence,  an  application  of  (13)  and  (14)  to  (9)  yields 


n^g{W  +  l)-Wg{W)]  =  ^p,IE[5(1H  +  1)-5(1H,  +  1)] 

n 

=  J2PjnXMWj  +  2)-g{Wj  +  l)]] 

n 

=  Y.p]n9{Wj  +  2)-g{Wj  +  l)]. 
i=i 

By  applying  the  bound  (12)  to  (15),  we  obtain 

n 

|P(1H  G  H)  -  Po(A){H}|  <  snv\g{k  +  l)-g{k)\Y,p] 

i=i 


fceiN 


-,-A  ri 


^  1  c  2 
<  - ^ 


A 


i=i 


which  completes  the  proof  of  Theorem  2.1. 

Note  that  by  choosing  pj  =  p  for  each  j,  and  using  the  fact  that  A  =  np, 

\W{W  gA)-  Po(A){H}|  <  p{l  -  e-^P), 


(15) 


(16) 


(17) 


implying  the  rate  of  convergence  of  a  Binomial  distribution  with  parameters  n  and  p,  to 
a  Poisson  distribution  with  mean  np,  is  of  order  p. 


In  the  following  subsection  we  outline  the  way  Stein’s  Method  is  used  for  continuous 
distributions. 


2.3  A  Continuous  Case:  Gaussian  Approximation 


The  Gaussian  distribution  has  been  used  widely  by  radar  analysts  as  a  model  for  clut¬ 
ter  and  noise  [Levanon  1988].  Due  to  its  importance,  and  as  an  indication  of  the  way 
Stein’s  Method  is  used  for  continuous  distributions,  we  consider  the  case  of  Gaussian  ap¬ 
proximation.  The  following  development  is  taken  from  [Chen  and  Shao  2003  and  Stein 
1986].  In  the  Spirit  of  [Stein  1972,  1986],  we  derive  a  Berry-Esseen  type  bound  [Esseen 
1945,  Eeller  1968]  on  the  Gaussian  distributional  approximation  of  a  sum  of  independent 
random  variables. 

Throughout  we  will  let  Z  =  Gau(0, 1)  be  a  standard  Gaussian/Normal  random  variable, 
with  mean  0  and  variance  1,  with  distribution  function 

!i>(2) = 4=  r 

V  Ztt  J —oo 


Eollowing  is  the  main  result: 
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Theorem  2.2  Let  {Xj,j  G  IN}  be  a  sequence  of  independent  random  variables  with  mean 
zero,  and  let  W  =  J2^=iXj.  Suppose  Wj\Xj  P  <  oo  and  J2^=i^Xj  =  1.  Then  for  any 
absolutely  continuous  function  h  satisfying  sup^,  \  h'(x)\  <  c, 

n 

\lEh{W)-lEh{Z)\  <3c^]E|Xj|3. 

i=i 

Theorem  2.2  does  not  enable  a  central  limit  result  to  be  obtained  because  it  is  a  smoothness 
estimate  for  a  class  of  absolutely  continuous  functions.  Indicator  functions  are  obviously 
not  included  in  this  class.  However,  it  allows  estimates  to  be  made  under  metrics  weaker 
than  total  variation  based  ones.  A  discussion  of  weaker  metrics,  together  with  their  merits, 
can  be  found  in  [Weinberg  1999].  Considerably  more  analysis  is  required  to  extend  the 
work  presented  here  to  derive  total  variation  bounds.  However,  since  the  focus  is  on 
illustrating  the  application  of  Stein’s  Method,  it  is  worth  examining  Theorem  2.2. 

Based  upon  [Stein  1972],  the  Stein  equation  for  Gaussian  approximation  is 

g'{x)  -  xg{x)  =  h{x)  -  $(z),  (18) 

where  is  the  standard  Gaussian  cumulative  distribution  function,  and  h{x)  is  a  suit¬ 
able  test  function.  Notice  by  a  transformation  of  h' [x)  =  g{x),  the  left  hand  side  of  (18) 
becomes  the  generator  (7).  In  the  cases  where  we  are  interested  in  distributional  approx¬ 
imations  under  a  total  variation  norm,  a  suitable  choice  for  h  is  h{x)  =  If  tliis  is 

the  case,  then  if  W  is  a  random  variable,  (18)  implies 

lEb'(IH)  -  Wg{W)]  =  F(1T  <  z)  -  $(z).  (19) 

As  in  the  Poisson  case,  we  attempt  to  derive  bounds  on  the  expectation  component  of 
(19). 

It  can  be  shown  that  a  unique  solution  exists  to  (18),  and  is  given  by 

^2  pX  ^2 

g{x)  =  e~^  /  [h{w)  —  $(z)]e“^ dw.  (20) 


Let  {Xj,j  G  IN}  be  a  sequence  of  independent  (continuous)  random  variables,  with  zero 
mean  and  also  impose  the  condition  that  =  I-  As  in  the  Poisson  case,  we 

consider  W  =  Xj,  and  define  Wj  =  W  —  Xj.  Also  define  Kj{t)  =  IE[A'j[lI[o<t<Xj]  ~ 

II[X^-<i<0]]]- 

It  can  be  shown  that  Kj  satisfies  the  following  three  properties: 

Kj{t)  is  nonnegative; 


Kj{t)dt  =  IEX|; 
\t\Kj{t)dt  =  lE\X^\/2. 
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Let  /i  be  a  measurable  function  that  is  absolutely  integrable  with  respect  to  the  standard 
Ganssian  density,  so  that  IE|h(Z)|  <  oo.  Since  Xj  and  Wj  are  independent,  and  EXj  =  0, 

n 

W.Wg{W)  = 

n 

=  Y.^Xj[giW)  -  g{Wj)] 

i=i 


=  /  g'{Wj+t)dt 

j=i  JO 

^  POO 

^E  /  g\Wj+t)Kj{t)dt. 

■  1  J —  OO 


(21) 


i=i 


Also,  since 


it  is  clear  that 


n  n 

Y,  /  K^{t)dt  =  Y^x]  =  i, 

j=i 


(22) 


(23) 


poo 

^g'{W)  =  Y^  giW)Kj{t)dt. 
i=i 

Hence,  by  combining  (21)  and  (22),  it  follows  that 

^  poo 

na'iW)  -  Wg{W)]  =  ^E  /  [g'{W)  -  g'{Wj  +  t)]Kj{t)dt. 

j=i 

It  is  shown  in  [Chen  and  Shao  2003]  that  |5^^(a;)|  <  2c.  Hence,  by  an  application  of  the 
Mean  Value  Theorem  to  the  integrand  of  (23), 

^  poo 

\^[g'{W)-Wg{W)]\  <  2cY^  m  +  \Xj\)Kj{t)dt 

■  1  J —OO 


=  2c^(E|X||/2  +  E|Xj|EX: 

n 

<  scYnxfi- 

i=i 


(24) 


This  completes  the  proof  of  Theorem  2.2. 


Theorem  2.2  provides  a  Berry-Esseen  type  bonnd  on  the  Gaussian  approximation  of  a  sum 
of  random  variables,  but  with  a  lot  more  effort,  one  can  show  the  following  using  similar 
techniques: 

Theorem  2.3  Let  {^j,j  G  IN}  be  a  sequence  of  independent  random  variables  with  mean 
zero,  Ej^jp  <  oo  and  Y^^=i  E^j^  =  1.  Then 

\\C{W)  -  C{Z)\\  =  sup  |E(1V  <z)-  $(^)|  <  2.2V4/3i  +  3/?2, 
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where  C  denotes  probability  law  or  distribution,  ||  •  ||  is  the  total  variation  norm,  Pi  = 

j:]=inx]^x,\>i]]  andP2  =  E”=iIE[|^,flI[|x,|<i]]- 


Theorem  2.3  gives  a  bound  on  the  rate  of  convergence  of  the  distribution  of  a  sum  of 
random  variables  to  a  Gaussian  limit,  and  so  is  a  central  limit  theorem.  Many  other  such 
bounds  can  be  found  in  [Stein  1972,  1986]. 
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3  Stein’s  Method  for  Exponential 
Approximation 


The  purpose  of  the  following  is  to  discuss  the  development  of  Stein’s  Method  for  distri¬ 
butional  approximations  of  random  variables  by  an  Exponential  distribution.  There  are 
many  processes  of  interest  that  can  be  approximated  by  such  a  distribution.  Examples 
include  the  time  it  takes  for  a  stochastic  process  to  enter  a  set  of  particular  states  [Aldous 
1989],  and  the  arrival  time  of  an  event  in  a  stochastic  process  [Ross  1983].  In  a  radar  con¬ 
text,  there  are  a  number  of  distributions  of  interest  that  are  approximately  Exponential. 
The  time  to  a  detection  in  a  constant  false  alarm  rate  detection  process  is  roughly  Expo¬ 
nential,  possibly  dependent  on  the  clutter  assumptions.  In  radar  imaging  systems  such  as 
synthetic  aperature  radar,  some  speckle  models  are  also  approximately  Exponential. 

An  Exponential  random  variable  is  a  particular  case  of  a  Gamma,  and  Stein’s  Method  has 
been  developed  for  the  latter  in  [Luk  1994].  The  Stein  equation  for  Gamma  approximations 
can  be  found  in  [Reinert  2003],  who  considers  the  approximation  by  distributions.  The 
latter  is  also  intimately  related  to  the  Gamma  distribution.  In  the  case  of  Exponential 
approximations,  a  much  simpler  form  of  the  Stein  equation  is  available.  In  the  following 
we  introduce  this  Stein  equation,  and  show  how  it  can  be  used  in  a  simple  context. 


3.1  The  Stein  Equation 


Recall  that  a  continuous  random  variable  W  is  said  to  have  an  Exponential  distribution 
with  positive  parameter  A  if  its  density  is  fw{x)  =  Ae“^*,  for  a:  >  0.  Its  cumulative 
distribution  function  is  Fw{x)  =  1  —  e~^^.  It  can  be  shown  that  the  mean  value  of  W 
is  j,  while  its  variance  is  We  attempt  to  find  an  equation  which  characterises  this 
distribution.  In  light  of  the  form  of  the  Stein  equation  for  Gaussian  approximations,  we 
examine  expressions  for  an  expectation  of  derivatives  of  smooth  functions  in  this  variable. 
We  also  provide  a  more  detailed  discussion  than  the  examples  considered  in  the  previous 
section.  This  is  due  to  the  fact  that  the  work  presented  here  is  new,  and  still  under 
development. 


Suppose  C  is  the  class  of  all  continuous  and  piecewise  differentiable  functions  that  map 
the  real  numbers  to  themselves.  Since  we  will  be  looking  at  the  solution  of  differential 
equations,  we  define  a  subset  C*  C  C,  of  all  functions  in  C  whose  domain  is  the  nonnegative 
real  line,  and  that  satisfy  the  two  boundary  conditions  that  g{0)  =  0  and^  g{x)  =  o{e^^) 
as  a:  — >  oo.  Then  if  W  is  an  Exponentially  distributed  random  variable,  with  parameter 
A,  and  g  G  C*, 

poo 

]Eg'{W)  =  /  g'{x)Xe-^^dx 

Jo 


g{x)\e 


—Ax 


+ 


g{x)\^e  ^^dx 


^The  notation  f{x)  =  o{g{x))  as  a;  — >■  oo  means  that  lim 


fix) 

gix) 


=  0. 
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POO 

=  A  /  g{x)Xe~^^dx 
Jo 

=  AIEff(M7).  (25) 

Hence,  for  any  g  G  C*,  if  W  =  Exp(A),  then  lEgr'(lH)  =  A]E5(H7).  What  is  less  apparent 
is  that  the  converse  of  this  statement  is  also  true.  That  is,  if  (25)  holds  for  all  functions 
g  ^  C* ,  then  this  implies  that  W  is  Exponentially  distributed. 

To  demonstrate  this,  consider  the  equation 

g'{x)  -  Xg{x)  =  -  Exp(A){a},  (26) 

where  the  indicator  function  II[a;<a]  is  defined  as  previously,  and  Exp(A){a}  is  the  cumu¬ 
lative  probability  distribution  of  an  Exponential  random  variable,  with  mean  A,  on  the 
interval  [0,  a],  for  some  a  >  0.  Hence  Exp(A){a}  =  1  — The  differential  equation  (26) 
is  the  Stein  equation  for  Exponential  approximations.  By  a  simple  functional  transfor¬ 
mation,  it  can  be  shown  to  be  equivalent  to  the  corresponding  Stein  equation  in  [Reinert 
2003].  We  need  to  show  there  is  a  unique  and  well  defined  solution  to  (26).  Then  if  W  is 
some  random  variable  with  support  the  nonnegative  real  line, 

fE[g'{W)  -  Xg{W)]  =  fP{W  <  a)  -  Exp(A){a}.  (27) 

As  in  the  Poisson  and  Gaussian  cases,  bounding  the  expectation  term  in  (27)  will  give 
bounds  on  the  Exponential  approximation  of  W.  The  next  subsection  is  concerned  with 
the  existence  and  uniqueness  of  the  solution  to  (26). 


3.2  Solution  to  Stein  Equation:  Existence  and  Uniqueness 


It  is  a  well-known  fact  that  a  first  order  linear  differential  equation  of  the  form  g'{x)  — 
Xg{x)  =  k{x),  with  k  a  function,  and  boundary  conditions  ^(0)  =  tti  and  g'{Q)  =  t^2  has 
a  unique  solution  [Kreyszig  1988,  Theorem  1,  Section  2.9].  To  construct  the  solution  to 
(26),  multiply  both  sides  of  the  Stein  equation  (26)  by  and  observe  that 


d 


e-^^g'ix)  -  Xe-^^g{x)  =  ^^-""^(x). 


(28) 


from  the  product  rule  for  differentiation.  Hence,  by  integration  and  applying  (28)  to  (26), 
we  obtain 


g{x)  =  e 


Xx 


^—Xt 


2[t<al  -  Exp(A){a} 


dt. 


(29) 


Note  that  (29)  satisfies  ^(0)  =  0  and  lim2;_^oo  e~^^g{x)  =  0.  In  order  to  show  that  g 
defined  by  (29)  is  well  defined,  we  show  that  it  is  bounded  in  the  supremum  norm  on  the 
nonnegative  real  numbers.  It  is  useful  to  firstly  simplify  (29).  It  is  not  difficult  to  show 
that 

rxAa 


g{x)  =  e 


Xx 


^—Xt 


dt  —  {1  —  e 


—Xa\ 


^Xx 


=  -p 


3— A(xAa) 


[1 


^—Xa 


][i 


e  ^^dt 


Axi 


(30) 
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where  a;  A  a  is  the  minimum  of  x  and  a.  By  considering  the  two  cases  where  x  Aa  =  x  and 
a;  A  a  =  a,  it  can  be  shown  that  (30)  is  equivalent  to 

g{x)  =  j  -  e-^“)  .  (31) 

Consequently,  it  follows  that 

1  _  p-Aa 

II5II  =  sup  |5r(a;)|  <  - - - .  (32) 

x>Q  A 

This  shows  that  the  solution  (29)  to  the  Stein  equation  (26)  is  well  defined.  Hence  g  in 
(31)  is  the  uniquely  well  defined  solution  to  the  Stein  equation  (26). 

The  function  g  is  continuous  and  piecewise  differentiable,  and  so  5  G  C*,  since  it  satisfies 
the  appropriate  boundary  conditions.  The  only  point  on  the  nonnegative  real  line  where 
it  is  not  differentiable  is  at  a;  =  a.  In  fact,  it  is  not  difficult  to  show  that 

gA(x-a)  if  X  <  a; 

g'{x)  =  <  0  if  X  >  a;  (33) 

undefined  if  x  =  a. 

V 

Given  a  random  variable  W ,  in  order  to  assess  its  closeness  in  distribution  to  an  Expo¬ 
nential  random  variable,  we  can  estimate  the  expectation  JE[g'{W)  —  Xg{W)]  and  see  how 
close  to  zero  it  is.  We  illustrate  this  in  the  next  subsection  using  a  very  simple  application. 


3.3  Rate  of  Convergence  of  a  Truncated  Exponential  Dis¬ 
tribution 

Consider  the  truncated  Exponential  distribution,  with  density  /(x)  =  ,  for  some 

0  >  0,  and  for  x  G  [0,0].  Clearly,  as  0  — >  00,  this  density  converges  to  that  of  an 
Exponential  random  variable.  We  use  this  example  to  illustrate  the  application  of  Stein’s 
Method  for  Exponential  approximations.  We  basically  need  to  estimate  the  expectation 
M[g'{W)  —  \g{W)],  where  W  has  the  density  /  above. 

Note  that,  by  an  application  of  integration  by  parts, 

/■oo  \p~^^ 

W.g'{W)  =  g'{x).^-^dx 

=  i_p-Ae  +  A(1  -  e-^^)lE5(W)]  .  (34) 

Hence  it  follows  that 

ng'iW)  -  Xg{W)]  =  (35) 
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An  application  of  the  smoothness  estimate  (32)  to  (35)  yields 


|P(Vh  <  a)  -  Exp(A){a}|  =  \^[g'{W)  -  \g{W)] \ 


< 


< 


1  —  e 


-\e 


1  —  e 


-\e  • 


(36) 


The  bound  (36)  gives  the  rate  of  convergence  of  the  truncated  Exponential  distribution, 
to  a  standard  Exponential  distribution.  As  expected,  this  bound  decreases  to  zero  as  9 
increases  without  bound. 


3.4  Towards  a  General  Bound 


This  section  outlines  how  the  Exponential  Stein  equation  (26)  can  be  used  to  derive  a 
general  bound  on  the  Exponential  approximation  of  a  continuous  random  variable  W .  At 
this  stage,  the  work  is  in  need  of  further  development.  However,  some  progress  has  been 
made  toward  establishing  a  useful  result. 


We  are  interested  in  the  distributional  approximation  of  W  by  an  Exponential  random 
variable  with  parameter  A.  Hence,  we  assume  lEW  =  j.  Eirstly,  as  in  the  derivation  of 
(21)  for  the  Gaussian  case,  note  that  we  can  write 


rw 

JEg{W)  =  IE  /  g'{t)dt 

Jo 

poo 

=  IE  /  JL[w>t]g'{t)dt 


=  /  P(IT  >  t)g'{t)dt. 

Jo 

It  can  be  shown  [Billingsley  1986]  that  for  a  continuous  random  variable  W, 

roo 

JEW=  /  JP{W>t)dt. 

Jo 

Hence,  an  application  of  the  above  results  in 
1 


1 

-g'{W)  =  [WW]g'{W)  =  /  W{W  >  t)g'{W)dt. 
A  Jo 


Taking  expectations  in  the  above  yields 


nTd'iw)]  =  /  p(w  >  t)ng'iw)]dt. 
A  Jo 


(37) 


(38) 
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Consequently,  by  combining  both  (37)  and  (38),  we  obtain 
n9'{W)-Xg{W)\  = 


/  P(PC  >  t)dt 
Jo 

What  is  now  required  is  a  bound  on  ]Eg'{W)  —  g'{t),  such  that  it  does  not  depend  on  t. 
Suppose  we  have  a  bound  of  the  form 

\^g'iW)-g'it)\<lEC{W,a),  (40) 

where  C  is  some  function,  and  ]E((^(W,a)  is  a  bound  on  the  moments  of  W.  Then  an 
application  of  (40)  to  (39)  will  yield 

P(W  <  a)  -  Exp(A){a}  =  IE[ff'(W)  -  \g{W)]  <  ]EC(W4,  a).  (41) 

The  function  ^  may  in  fact  be  a  uniform  bound  on  the  differences  of  the  derivatives  of  the 
Stein  equation  (31).  Specifically,  based  upon  (33),  it  may  be  possible  to  establish  a  bound 
on  g'{x)  —  g'{y).  As  an  example,  it  can  be  shown  that 

Ha'iW)  -  g'{t)]  <  1  A  (e-“]Ee^^)  =  1  A  (e-“]Mvv(A)), 

where  ]M^(A)  is  the  moment  generating  function  of  W  [Billingsley  1986].  It  may  then  be 
possible  to  use  properties  of  the  moment  generating  function  to  produce  a  useful  bound. 
It  will  also  be  necessary  to  extract  lower  bounds,  of  a  similar  form,  in  order  to  obtain 
bounds  on  a  norm  of  ]E^(W,  a).  These  ideas  will  be  pursued  in  future  work. 


xnjg'iw)  -  g{W)] 

poo 

X  /  JP{W  >  t)  [lEff'(lT)  -  g'{t)]  dt 
Jo 

poo 

/  nW>t)[lEg'{W)-g'{t)]dt 

^ - ^oo - •  (39) 
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4  Potential  Radar  Applications 


The  purpose  of  this  final  section  is  to  identify  a  number  of  random  variables  and  stochastic 
processes  of  interest,  in  a  radar  context,  and  to  indicate  how  Stein’s  Method  can  be  used  to 
assess  distributional  approximations  for  these.  It  is  hoped  that  these  potential  applications 
will  generate  some  interesting  radar  related  research.  These  applications  fall  under  the  two 
main  categories  of  radar  detection  and  inverse  synthetic  aperture  radar.  There  are  many 
more  problems  where  Stein’s  Method  could  be  employed,  especially  in  the  areas  of  clutter 
statistics  and  detection  theory.  As  an  aside,  we  point  out  that  the  ideas  developed  in  the 
context  of  Exponential  approximation  could  be  applied  to  some  applications  in  critical 
phenomena  in  physics.  Specifically,  there  is  scope  to  apply  the  development  in  Section  3 
to  [Abadi  2001]. 


4.1  Assessing  Approximations  of  Clutter  Statistics  in  De¬ 
tection 


Radar  detection  theory  deals  with  the  problems  associated  with  deciding  whether  there  is 
a  target  present  in  a  noisy  environment  [Hippenstiel  2002] .  An  important  type  of  detector 
is  the  constant  false  alarm  rate  (GEAR)  detection  scheme  [Levanon  1988  and  Minkler  and 
Minkler  1990].  At  each  stage,  a  set  of  clutter  statistics  are  extracted,  from  which  an  average 
measure  of  the  clutter  is  obtained.  This  is  then  compared  to  a  test  cell  observation.  If  this 
test  cell  observation  exceeds  a  normalised  clutter  threshold,  a  target  is  declared  present. 
Otherwise,  there  is  no  target  in  the  cell  under  test.  A  performance  measure  of  such  a 
scheme  is  the  probability  of  false  alarm.  For  a  particular  GEAR  scheme  one  would  like  to 
be  able  to  estimate  this  probability.  This  is  usually  done  by  directly  applying  Monte  Garlo 
techniques  [see  Weinberg  2004  and  references  contained  therein].  In  many  cases  Monte 
Garlo  estimation  is  problematic,  and  so  it  may  be  useful  to  be  able  to  approximate  the 
clutter  statistic  firstly. 

To  illustrate  this  idea,  we  suppose  we  have  m  independent  and  identically  distributed 
clutter  statistics  Ai,  A2, . . .  ,Xm,  and  the  cell  under  test  statistic  is  Xq.  If  r  is  the  GEAR 
threshold  parameter,  then  the  probability  of  false  alarm  is 

JPfa  =  Jp(xo>  -fiXi,X2,...,Xm)),  (42) 

where  /  determines  the  level  of  the  clutter,  and  Aq  has  the  same  distribution  as  the  clutter 
statistics  [see  Weinberg  2004].  There  are  many  choices  available  for  /.  In  the  case  of  cell 
averaging  GEAR,  /(Ai,  A2, . . . ,  A^)  =  Ai  +  A2  +  •  •  •  +  A^.  Another  common  choice  is  the 
smallest-of  GEAR,  where  /(Ai,  A2, . . . ,  A^)  =  min{Ai  +  •  •  •  +  Aj.,  A^+i  +  •  •  •  +  A^},  for 
some  1  <  k  <  m.  In  a  Monte  Garlo  estimation  of  (42),  one  would  find  a  suitable  importance 
sampling  estimator,  since  direct  standard  Monte  Garlo  estimation  of  a  probability  such 
as  (42)  requires  an  enormous  number  of  simulations  [Weinberg  2004].  In  some  cases, 
importance  sampling  can  also  be  problematic.  A  possible  remedy  to  this  is  to  approximate 
the  distribution  of  the  clutter  statistic  /(Ai,  A2, . . . ,  Xm)  and  then  use  this  together  with 
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Monte  Carlo  estimation  of  (42).  This  idea  is  nsed  in  [Weinberg  2004]  to  a  certain  degree. 
The  qnestion  of  the  validity  of  the  distributional  approximation  of  f{Xi,X2,...,Xm) 
naturally  arises.  The  clutter  function  /  can  be  written,  in  many  cases,  as  a  sum  of 
random  variables.  In  the  case  of  the  cell  averaging  clutter  measure,  this  is  obvious.  In  the 
case  of  the  smallest-of  clutter  statistic,  we  can  write  it  as 

k 

f{Xi,X2,...,Xm)  =  Yl^i^Xl+-+Xk<Xk+i+-+X^] 

j=^ 

m 

+  E  lXi+-+Xk>Xk+i+-+x^]-  (43) 

j=k+l 

The  random  variable  sum  in  (43)  is  hence  just  a  sum  of  dependent  random  variables. 
Hence,  for  a  particular  clutter  scenario  and  specific  clutter  measure  /,  we  are  interested 
in  the  distributional  approximation  of  a  sum  of  random  variables.  As  has  been  illustrated 
in  this  report,  Stein’s  Method  has  been  adapted  to  such  problems,  and  there  are  many 
bounds  on  the  accuracy  on  such  distributional  approximations.  Thus  Stein’s  Method  could 
be  used  to  assess  the  validity  of  approximations  made  to  facilitate  Monte  Carlo  estimation 
of  radar  performance  measures. 


4.2  Time  to  Detection  in  Constant  False  Alarm  Rate  Scheme 


The  time  it  takes  for  a  detection  in  CFAR,  or  more  specifically,  the  time  between  two 
detections,  is  of  importance  in  radar  detection  because  it  is  the  correlation  of  such  events 
that  indicates  the  presence  of  a  target.  From  a  statistical  distribution  point  of  view,  it 
would  be  a  rather  complex  random  variable,  with  a  possibly  long  tailed  distribution.  Since 
detections,  in  a  sense,  are  like  rare  events,  the  waiting  time  is  roughly  like  the  waiting  time 
to  a  rare  event.  A  Poisson  process  [Ross  1983]  can  be  used  as  a  model  for  counting  the 
occurrences  of  rare  events,  and  the  waiting  time  in  such  a  process  has  an  Exponential 
distribution.  Hence,  as  a  first  order  approximation,  an  Exponential  distribution  could  be 
used  for  the  waiting  time  to  a  detection. 

To  formulate  this  idea  mathematically,  suppose  we  have  a  discrete  time  stochastic  process 
{Xj,j  G  IN},  where  each  Xj  is  an  indicator  random  variable,  determined  at  each  CFAR 
detection  decision,  defined  by 


1  if  tj  <  Tj\ 
0  otherwise. 


(44) 


For  each  j,  Tj  is  the  CFAR  threshold,  and  tj  is  the  cell  under  test  statistic.  Hence  Xj  is 
one  if  there  has  been  no  detection  at  the  jth  timepoint.  Then  we  define  a  random  variable 
kF  as 


W  =  inf  {A,-  =  0}. 
ieiN 


(45) 


Random  variable  W  is  the  first  time  the  stochastic  process  hits  state  0,  and  so  counts  the 
discrete  time  it  takes  for  a  detection.  Although  kF  is  a  relatively  simple  random  variable. 
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it  is  distributionally  complex.  However,  a  number  of  approximations  could  be  made.  Note 
that  W  is  the  number  of  failures  before  the  first  success,  in  a  series  of  dependent  Bernoulli 
trials  [Billingsley  1986],  with  a  varying  success  probability.  Hence,  as  a  first  approximation, 
it  is  roughly  a  Geometric  random  variable.  Stein’s  Method  could  be  used  to  assess  this 
approximation,  using  the  work  of  [Pekoz  1996,  Phillips  1996  and  Phillips  and  Weinberg 
2000]. 

Alternatively,  since  the  random  W  can  be  viewed  as  the  discretised  version  of  the  waiting 
time  to  a  rare  event,  it  could  in  turn,  be  approximated  by  an  Exponential  random  variable. 
The  work  of  Section  3,  when  developed  further,  could  be  used  to  assess  this  approximation. 


4.3  Speckle  Modelling  in  Inverse  Synthetic  Aperature  Radar 


Inverse  Synthetic  Aperature  Radar  (ISAR)  is  a  useful  technique  for  the  acquisition  of  high 
resolution  images  of  a  target  of  interest  [Curlander  and  McDonough  1991  and  Oliver  and 
Quegan  1998].  Speckle  is  a  phenomenon  found  in  coherent  imaging  systems,  such  as  ISAR. 
In  ISAR,  both  the  amplitude  and  the  phase  of  the  backscattered  radiation  are  recorded. 
Consequently,  a  grainy  noise  like  characteristic  is  introduced.  The  latter  is  referred  to  as 
speckle.  Each  resolution  cell  of  the  system  contains  many  scatterers.  The  phase  of  each 
returned  signal  from  each  scatterer  is  random,  and  the  correlation  between  these  cause 
speckle. 

[Daba  and  Bell  1994]  examine  partially  developed  speckle,  modelling  the  surface  scat¬ 
tering  statistics  as  a  marked  point  process  [Daley  and  Vere- Jones  1988].  Consequently, 
approximations  are  derived  for  the  distributions  of  intensity  of  a  single  look  and  multi- look 
speckle,  using  orthonormal  Laguerre  polynomial  representations. 


In  their  model,  it  is  assumed  that  a  resolution  cell  contains  a  fixed  number  n  of  point 
scatterers,  randomly  distributed  throughout  the  resolution  cell,  such  that  each  scatterer’s 
position  is  independent  of  that  of  other  scatterers.  Each  backscatter  electric  field  compo¬ 
nent  has  a  constant  amplitude  Aj  (j  G  {1,  2, . . .  ,n})  and  a  random  phase  where  the 
latter  is  assumed  to  be  uniformly  distributed  over  the  interval  [0, 27r).  The  overall  single 
look  speckle  intensity  measurement  is 


Sk 


E 

i=i 


A. 


(46) 


Multi-look  speckle  is  modelled  as  a  noncoherent  sum  of  L  statistically  independent  single 
realisations  of  the  single  look  speckle  intensity: 


T(n,L) 


i=i 


(47) 


It  is  shown  in  [Daba  and  Bell  1994]  that  (46)  has  an  asymptotic  Exponential  distribution, 
while  (47)  has  an  asymptotic  Gamma  distribution.  An  application  of  the  work  in  Section  3, 
in  conjunction  with  the  development  in  [Luk  1994  and  Reinert  2003],  could  yield  conditions 
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under  which  the  limiting  distributions  exist,  and  the  rate  of  convergence  of  (46)  and  (47) 
to  their  respective  asymptotic  distributions. 

Point  processes  [Daley  and  Vere-Jones  1988]  are  a  natural  tool  to  use  in  the  stndy  of 
imaging  problems,  since  they  model  the  distribution  of  points  within  a  region  of  space. 
Stein’s  Method  has  been  developed  extensively  for  Poisson  point  process  approximation 
[Barbour  and  Brown  1992  and  Barbonr,  Holst  and  Janson  1992],  and  since  a  Poisson  point 
process  is  a  model  of  completely  random  scattering  of  points,  the  Poisson  approximation 
of  processes  such  as  (47)  could  also  be  investigated. 

In  addition,  since  (47)  has  a  limiting  Gamma  distribution,  the  idea  of  Gamma  point 
processes  conld  be  developed,  together  with  Stein’s  Method  for  point  process  approxima¬ 
tions  in  this  setting.  This  would  introduce  a  completely  new  process  into  the  class  of  point 
processes,  as  well  as  a  new  development  of  Stein’s  Method. 
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5  Conclusions 


This  report  introduced  Stein’s  Method  in  general  terms,  and  then  developed  its  application 
to  Poisson  and  Gaussian  approximations.  The  ongoing  development  of  Stein’s  Method 
for  Exponential  approximation  was  examined,  and  its  application  illustrated  in  a  simple 
context.  A  significant  amount  of  effort  is  still  required  to  obtain  useful  general  results. 

The  application  of  Stein’s  Method  to  two  areas  of  radar  research  were  outlined.  Problems 
in  radar  detection  and  ISAR  have  been  identified,  where  the  techniques  presented  here 
could  be  applied.  Stein’s  Method  has  the  potential  to  provide  some  interesting  results  on 
the  rates  of  convergence  of  radar  related  processes.  It  is  hoped  that  this  report  instigates 
the  development  and  application  of  Stein’s  Method  to  defence  applications. 
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Appendix  A:  Mathematical  Notation 


Natural  numbers  {0, 1,2,.. .}. 

Real  numbers. 

Positive  real  numbers. 

Probability. 

Expectation  with  respect  to  P. 

Class  of  all  real  valued  continuous  and  piecewise  differentiable  functions. 

Subclass  of  C,  defined  on  page  12. 

Indicator  function:  lupAl  =  I  n  ^ 

\  0  otherwise. 

Equality  in  distribution:  X  is  equivalent  to  P(X  e  ^)  =  P(E  €  for  all  sets  A. 
Moment  generating  function  /  Laplace  transform,  defined  on  page  16. 

Poisson  Distribution  with  mean  A:  if  X  =  Po(A),  then 
P(X  =  j)  =  for  all  j  G  IN. 

Cumulative  Poisson  probability  on  set  A  C  IN. 

Exponential  Distribution  with  mean  if  X  =  Exp(A),  then 
P(X  eA)  =  f^  Xe-^^dx,  for  ^4  c  IR+  U  {0}. 

Cumulative  Exponential  probability  on  interval  [0,  a]. 

Standard  Gaussian  (or  Normal)  distribution,  with  cumulative  distribution 
function  defined  on  page  8. 
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A  Generator  of  a  Markov  Process,  defined  on  page  5. 
sup^  Supremum  on  set  A. 
infyi  Infimum  on  set  A. 

II  •  II  Total  variation  norm:  ||g||  =  sup^.^^^  |5(2:)|- 
a  V  6  Maximum  of  a  and  b. 
a  Ab  Minimum  of  a  and  b. 
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